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METHOD FOR PURIFYING KERATINOCYTE GROWTH FACTORS 

Field of thg Tnr-ir-i-n 

The present invention relates to the field of 
protein purification. Specifically, the present 
invention relates to the field of purifying keratinocyte 
growth factors. 



10 



Background of rh» t^,^,.^ 



15 



Polypeptide growth factors are imoprtant 
mediators of intercellular communication (Rubin et al 
(1989), Proc. itetl. Acad. Sci. USA, 55:802-806). These 
molecules are generally released by one cell type and 
act to influence proliferation of other cell types. 

One family of growth factors is the fibroblast 
growth factors (FGF) . There are currently eight known 
FGF family members which share a relatedness among 
20 primary structures: basic fibroblast growth factor 
bFGF (Abraham et al. (1986), EMBO J.. 5:2S23-2S28) ; 
acidic fibroblast growth factor. aFGF (Jaye et al ' 
(1986), Science. 221:541-545); int-2 gene product, int-2 
(Dickson & Peters (1987), Nature. 225:833); hst/kFGF 
25 (Delli-Bovi et al. (1987), Cell. 55:729-737, and 
Yoshida et al. (1987), Proc. Natl. Acad. Sci. USA, 
54:7305-7309); FGF-5 (Zhan et al. (1988) , Jfol. Cell. 
Biol., 5:3487-3495); FGF- 6 (Maries et al. (1989). 
Oncogene, 4:335-340); keratinocyte growth factor (Finch 
30 et al. (1989). Science. 24:752-755; Rubin et al. (1989), 
Proc. Natl. Acad. Sci. USA, ££ : 802-806; Ron et al. 
(1993), The Journal of Biological Chemistry, 
255141:2984-2988; and Van et al. (1991), In Vitro Cell. 
Dev. Biol., 22A:437-438); and hi sac tophi 1 in (Habazzettl 
35 et al. (1992), Mature, 255:855-858). 



WOW119S2 



PCT/US92/13099 



- 2 - 



10 



30 



35 



Among the FGF family of proteins, keratinocyte 
growth factor (KGF) is a unique effector of non- 
fibroblast epithelial (particularly keratinocyte) cell 
proliferation derived from mesenchymal tissues The 
term -native KGF- refers to a natural human (hKGF) or 
recombinant (rKGF) polypeptide (with or without a signal 
sequence) as depicted by the amino acid sequence 
presented in SEQ ID NO:2 or an allelic variant thereof 
[Unless otherwise indicated, amino acid numbering for 
molecules described herein shall correspond to that 
presented for the mature form of the native molecule 
(i.e., minus the signal sequence), as depicted by amino 
acids 32 to 194 of SEQ ID NO:2.] 

Native KGF may be isolated from natural 
15 sources. For example, hKGF can be isolated from medium 
conditioned by an embryonic lung fibroblast cell line 
(Rubin et al. (1989), supra. Three chromatographic 
steps, namely heparin-Sepharose" (Pharmacia, Piscataway, 
NJ) affinity chromatography, HPLC gel filtration, and 
reverse-phase HPLC, were used to obtain a purified hKGF 
preparation. Approximately 6 mg of hKGF were recovered 
from 10 liters of conditioned medium. These 
chromatographic steps only recovered 0.8% total hKGF 
based upon a autogenic activity assay. A further 
example teaches the use of another chromatographic step 
using heparin- Sepharose" affinity and Mono-S* ion- 
exchange chromatographys (Pharmacia, Piscataway, NJ) for 
isolation of rKGF produced in bacteria (Ron et al. 
(1993), Journal of Biological Chemistry, 2£S:2984-2988) . 

The properties of keratinocyte growth factors 
suggest a potential for the application thereof as a 
drug for promoting specific stimulation of epithelial 
cell growth, it therefore would be desirable to develop 
a method or methods for obtaining relatively high levels 
of homogeneous keratinocyte growth factors to provide 
sufficient quantities of material for comprehensive in 
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vitro and in vivo biological evaluation and for a 
potential therapeutic application. 

It is the object of this invention to provide 
a novel method for the purification of keratinocyte 
5 growth factors. 

Summary of rh^ IgygaSlfln 

The present invention is directed to a first 
10 method for purifying a keratinocyte growth factor (KGF) , 
the method comprising: 

a) obtaining a solution containing KGF; 

b) binding KGF from the solution of part (a) 
to a cation exchange resin; 

15 c > eluting KGF in an eluate solution from 

the cation exchange resin; 
d) passing the eluate solution from part (c) 
through a molecular weight exclusion 
matrix; and 

20 «) recovering KGF from the molecular weight 

exclusion matrix. 

The. invention is further directed to a second 
method for purifying a keratinocyte growth factor (KGF) , 
25 the method comprising: 

a) obtaining a solution containing KGF; 

b) binding KGF from the solution of part (a) 
to a cation exchange res in ; 

c) eluting KGF in an eluate solution from 
30 the cation exchange resin; 

d) performing hydrophobic interaction 
chromatography on the eluate solution of 
part (c) ; and 

e) recovering KGF from the hydrophobic 

35 interaction chromatography step of part 

(d). 
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5 between about 6.8-7.5. Sult^ ^If* ^ 

step include carbexyfflefchyl oJ^^^.«- 

™ rr u i ated agarose - ™- ^ 

(e.g. , columns of S-Sepharose Fast Flow- resinT! ™ 
resin and CM-cellulose- resin , n Mono-s™ 
10 from Pharmacia Pi sea T coa »^ally available 

va^-^rT Plscatawa y. NJ). The flow rat e will be 
variable depending upon the column size 

The gel filtration step of the «~«. 
conducts <„ » flrst method may be 

conducted in any suitable buffer (e a x. 
saline) at a «w «* * g " P^spbate buffer 

^ at a P H °f Preferably between about 7 0 , c 
IS Suitable columns for use in this step Sill 

based, acrylamide-based silica J! agarose- 

ea ' si-Lica-based or polvmei-->w M ,* ■ 
size-exclusion columns (e g eo i 1Mm . * P ° lymer - based 

resin and. Superdex-75- ~ ' ° f SePhadeX 6 " 7S " 

from PhaJclT ' e — available 

20 T 

la a particularly preferred embodiment of the 

STtTTLf" sulfhydryl ^ -* 

Prxor to the hydrophobic interaction step, discussed 
below - toy .anner of oxidation may be e^loyeT Tor 

25 oxygen for a suitable period of time. Altera* 
various oxidation procedures may be employed ^ L 

^ol^eLT™ — -^-ocytT^ 

™! , ° r 003:6 residues, as 

compared to the native kgf i 
30 reola^ , L molecule, are deleted or 

replaced. In this procedure an oxidizing agent (e g 
cystine dihydrochloride or another appropriate " ' 

c nctta: e on 0r ^ ° e ^ " * «»! 
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temperature at preferably between about 10-30 *C, for an 
appropriate period. The second procedure may be used 
for oxidizing native KGF and other keratinocyte growth 
factors with comparable patterns of cysteine residues. 
5 In this procedure, oxidation may be accomplished by 
adding an appropriate amount of an ionic strength 
modifier (e.g., (NH4)2S04)), adjusting the pH to 
preferably between about 7.5-9.5, and holding the 
temperature at preferably between about 23 ± 5'C for an 
10 appropriate period. 

The hydrophobic interaction step of the second 
method may be .conducted by using any suitable buffer 
(e.g., sodium phosphate) at a pH of preferably between 
about 6.0-8.0, more preferably about 7.0, and by eluting 
15 with a decreasing linear (NH4)2S04 gradient ranging from 
2-0 M. Suitable columns for use in this step include 
alkyl or phenyl substituted resins • (e.g.., a column of 
Butyl-650M Toyopearl** resin, commercially available from 
Tosohaas, Inc., Montgomeryville, PA and columns of 
20 phenyl Sepharose 1 * resin and phenyl Superose w resin, 
commercially available from Pharmacia) . 

The process of the present invention may be 
used to purify KGF. Thus, it should be understood that 
the terms "keratinocyte growth factor" and "KGF" as 
25 employed in this description ate intended to include, 
and to mean interchangeably unless otherwise indicated, 
native KGF and KGF analog proteins (or "muteins") 
characterized by a peptide sequence substantially the 
same as the peptide sequence of native KGF and by 
30 retaining some or all of the biological activity of 

native KGF, particularly non- fibroblast epithelial cell 
proliferation (e.g., exhibiting at least about 500-fold 
greater stimulation of BALB/MK keratinocyte cells than 
that of NIH/3T3 fibroblast cells, and at least about 50- 
35 fold greater stimulation of BALB/MK keratinocyte cells 
than for BS/589 epithelial cells or for CC1208 
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epithelial cells, as determined by H- thymidine 
incorporation) . By "characterized by a peptide sequence 
substantially the same as the peptide sequence of native 
KGF" is meant a peptide sequence which is encoded by a 
5 DNA sequence capable of hybridizing to nucleotides 201 
to 684 of SEQ ID NO:l, preferably under stringent 
hybridization conditions. 

The determination of a corresponding amino 
acid position between two amino acid sequences may be 

10 determined by aligning the two sequences to maximize 

matches of residues including shifting the amino and/or 
carboxyl terminus, introducing gaps as required and/or 
deleting residues present as inserts in the candidate. 
Database searches, sequence analysis and manipulations, 

15 may be performed using one of the well-known and 
routinely used sequence homology/ identity scanning 
algorithm programs (e.g., Pearson and Lipman (1988), 
Proc. Natl. Acad. Sci. U.S.A., £5:2444-2448; Altschul ec 
al. (1990), J. Mol. Biol., 215:403-410; Lipman and 

20 Pearson (1985), Science, 222:1435 or Devereux eC al. 
(1984), Nuc. Acids Res., 12:387-395). 

Stringent conditions, in the hybridization 
context, -will be stringent combined conditions of salt, 
temperature, organic solvents and other parameters 

25 typically controlled in hybridization reactions. 
Exemplary stringent hybridization conditions- are 
hybridization in 4 X SSC at 62-67° C, followed by 
washing in 0.1 X SSC at 62-67° C. for approximately an 
hour. Alternatively, exemplary stringent hybridization 

30 conditions are hybridization in 45-55% formamide, 4 X 
SSC at 40-45°C. [See, T. Maniatis et. al., Molecular 
Cloning (A Laboratory Manual) ; Cold Spring Harbor 
Laboratory (1982), pages 387 to 389]. 

Thus, the proteins include allelic variations, 

35 or deletion(s), substitution (s) or insertion(s) of amino 
acids, including fragments, chimeric or hybrid molecules 
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of native KGF. One example of KGF includes proteins 
having residues corresponding to Cys 1 and Cys 15 of SEQ ID 
NO: 2 replaced or deleted, with the resultant molecule 
having improved stability as compared with the parent 
5 molecule (as taught in commonly owned U.S. S.N. 

08/487,825, filed on July 7, 1995). Another example of 
KGF includes charge-change polypeptides wherein one or 
more of amino acid residues 41-154 of native KGF 
(preferably residues Arg 41 , Gin 43 , Lys 55 , Lys 95 , Lys 128 , 

10 Asn 137 , Gin 138 , Lys 139 , Arg 144 , Lys* 4 " 7 , Gin"*, Lys*53 or 

Thr 154 ) are deleted or substituted with a neutral residue 
or negatively charged residue selected to effect a 
protein with a reduced positive charge (as taught in 
commonly owned U.S. S.N. 08/323,337, filed on October 13, 

15 1994) . A still further example of KGF includes proteins 
generated by substituting at least one amino acid having 
a higher loop- forming potential for at least one amino 
acid within a loop-forming region of Asu^S-g^gHS- 
Tyr 117 -Asn^ 18 -Thr 115 of native KGF (as taught in 

20 commonly owned U.S. S.N. 08/323,473, filed on October 13, 
1994) . • A still yet further example includes proteins 
having one or more amino acid substitutions, deletions 
or additions within a region of 123-133 (amino acids 
154-164 of SEQ ID NO:2) of native KGF; these proteins 

25 may have agonistic or antagonistic activity. 

Specifically disclosed proteins include the 
following KGF molecules (referred to by the "residue 
found at that position in the mature protein (minus 
signal sequence) set forth in SEQ ID NO:2, followed by 

30 that amino acid position in parentheses and then either 
the substituted residue or to designate a deletion) : 
C(1,15)S, AN15-AN24, AN3/C(15)S, AN3/C(15)-, AN8/C(15)S, 
AN8/C(15)-, C(1,15)S/R(144)E, C (1, 15) S/R (144) Q, 
AN23/R(144)Q, C(1,15,40)S, C (1, 15, 102) S, 

35 C(1,15,102,106)S, AN23/N(137)E, AN23/K(139)E, 

AN23/K(139)Q, 4N23/RU44) A, M23/R(144)E, 4N23/R(144) L, 



WO 96/11952 



PCT/US92/13099 



AN23/K(147)E, AN23/K(147)Q, AN23/K(153)E, AN23/K(153)Q, 
AN23/Q(152)E/K(153)E; R(144)Q and H{116)G. 

As those skilled in the art will also 
appreciate, a variety of host-vector systems may be 
5 utilized to express the KGF protein-coding sequence. 
These include but are not limited to eucaryotic cell 
systems such as mammalian cell systems infected with 
virus (e.g., vaccinia virus, adenovirus, etc.); insect 
cell systems infected with virus (e.g., baculovirus) ; 

10 microorganisms such as yeast-containing yeast vectors; 
or to procaryotic cell systems such as bacteria 
transformed with bacteriophage DNA, plasmid DNA, or 
cosmid DNA. The expression elements of these vectors 
vary in their strengths and specificities . Depending .on 

15 the host-vector system utilized, any one of a number of 
suitable transcription and translation elements may be * 
used. 

Once the protein product of KGF expression has 
been isolated, purified and assayed for KGF activity 

20 (using procedures known to those skilled in the art) , it 
may be formulated in a variety of pharmaceutical 
compositions . Typically, such compositions include a 
suitable";" usually chemically-defined, carrier or 
excipient for the therapeutic agent and, depending on 

25 the intended form of administration, other ingredients 
as well. The composition can include aqueous carriers 
or consist of solid phase formulations in which KGF is 
incorporated into non-aqueous carriers such as 
collagens, hyaluronic acid, and various polymers. The 

30 composition can be suitably formulated to be 

administered in. a variety of ways, including by 
injection, orally, topically, intranasally and by 
pulmonary delivery. 
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flr jof BsassiBfiioa of the Drawings 

Figure 1 shows the nucleotide (SEQ ID N0:1) 
and amino acid (SEQ ID NO :2) sequences of native KGF 
5 (the nucleotides encoding the mature form of native KGF 
is depicted by bases 201 to 684 of SEQ ID NO:l and the 
mature form of KGF is depicted by amino acid residues 32 
to 194 of SEQ ID NO:2) . 

Figures 2A, 2B and 2C show the plasmid maps of 
10 pCFM1156/.pCFM1656 and pCFM3102, respectively. 

Figure 3 shows the nucleotide (SEQ ID NO: 3) 
and amino acid (SEQ ID NO:4) sequences of theconstruct 
RSH-KGF. 

Figure 4 shows the nucleotide (SEQ ID NO: 5) 
15 and amino acid (SEQ ID NO: 6) sequences of the construct 
contained in plasmid KGF. 

Figure 5 shows the chemically synthesized 
OLIGOs (OLIGO#6 through OLIGO#ll; SEQ ID NO:12-17, 
respectively) used to substitute the DNA sequence 
20 between a Kpnl site and an EcoRI site for a Kpnl site 
(from amino acid positions 46 to 85 of SEQ ID No: 6) in 
the construct contained plasmid KGF to produce the 
construct in plasmid KGF(dsd). 

Figure 6 shows the chemically synthesized 
25 OLIGOs (OLIGO#12 through OI*IGO#24; SEQ ID NO:18-30, 
respectively) used to construct KGF(codon optimized). 

Figure 7 shows the nucleotide (SEQ ID NO:31) 
and amino acid sequences (SEQ ID NO:32) of C(1,15)S, a 
KGF analog having substitutions of serine for cysteine 
30 at and no acid positions 1 and 15 of native KGF. 

Figure 8 shows the nucleotide (SEQ ID NO: 33) 
and amin o acid sequences (SEQ ID NO: 34) of 
C(1,15)S/R(144)E, a KGF analog having substitutions of 
serine for cysteine at amino acid positions 1 and 15 and 
35 a substitution of glutamic acid for arginine at amino 
acid position 144 of native KGF. 
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Figure 9 shows the nucleotide {SEQ ID NO: 35) 
and amino acid (SEQ ID NO: 36) sequences of 
C(1,15)S/R(144)Q, a KGF analog having substitutions of 
serine for cysteine at amino acid positions 1 and 15 and 
5 a substitution of glutamine for arginine at amino acid 
position 144 of native KGF. 

Figure 10 shows the nucleotide (SEQ ID NO: 37) 
and amino acid (SEQ ID NO:38) sequences of AN15, a KGF 
analog having a deletion of the first 15 amino acids of 
10 the- N-terminus of native KGF. 

Figure 11 shows the nucleotide (SEQ ID NO: 39) 
and amino acid (SEQ ID NO:40) sequences of AN23, a KGF 
analog having a deletion of the first 23 amino acids of 
the N- terminus of native KGF, 
15 Figure 12 shows the nucleotide (SEQ ID NO: 41) 

and amino acid (SEQ ID NO:42) sequences of AN23/R(144)Q, 
a KGF analog having a deletion of the first 23 amino 
acids of the N-terminus and a substitution of glutamine 
for arginine at amino acid position 144 of native KGF. 

20 

Description of Specific Effifegfljfflgnfca 

: : Standard methods for many of the procedures 
described in the following examples, or suitable 

25 alternative procedures, ore provided in widely 

recognized manuals of molecular biology such as, for 
example, Molecular Cloning, Second Edition, Sambrook et 
al., Cold Spring Harbor Laboratory Press (1987) and 
Current Protocols in Molecular Biology, Ausabel et al., 

30 Greene Publishing Associates /Wiley- Interscience, New 
York (1990).. 

ExailPlf 1 : Preparation of DNA Coding for KGF and KGF Analogs 



35 



The cloning of the f till- length human KGF gene 
(encoding a polypeptide with the sequence of native KGF) 
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was carried out both by polymerase chain reaction (PGR) 
of RNA from an animal cell and by PGR of chemically 
synthesized {£. coli optimized codon) oligonucleotides 
("OLIGOs") • Both procedures are described below: 
5 PCR amplification using RNA isolated from 

cells known to produce the polypeptide was performed. 
Initially, cells from a human fibroblast cell line 
AG1523A (obtained from Human Genetic Mutant Cell Culture 
Repository Institute For Medical Research, Camden, New 

10 Jersey) were disrupted with guanidium thiocyanate, 
followed by extraction (according to the method of 
Chorayzinski et al. (1987), Anal. Biochem. , 172 :156) . 
Using a standard reverse transcriptase protocol for 
total RNA, the KGF cDNA was generated. PCR (PCR#1) 

15 amplification of the KGF gene was carried out using the 
KGF cDNA as template and primers OLIGO#l and OLIGO#2 
that encode DNA sequences immediately 5 1 and 3 ' of the 
KGF gene [model 9600 Thexmocycler (Perkin-Elmer Cetus, 
Norwalk, CT) ; 28 cycles; each cycle consisting of one 

20 minute at 94°C for denaturation. two minutes at 60°C for 
annealing, and three minutes at 72°C for elongation] . A 
small aliquot of the PCR#1 product was then used as 
template for a second KGF PCR (PCR#2) amplification 
identical to the cycle conditions described above except 

25 for a 50°C annealing ten^eratute . For expression 

cloning of the KGF gene, nested PCR primers were used to 
create convenient restriction sites at both ends of the 
KGF gene. OLIGO#3 and OLIGO#4 were used to modify the 
KGF DNA product from PCR#2 to include Mlul and BamHI 

30 restriction sites at the 5' and 3' ends of the gene, 

respectively [PCR#3; 30 cycles; each cycle consisting of 
one minute at 94°C for denaturation, two minutes at 60°C 
for annealing, and three minutes at 72°C for 
elongation] . This DNA was subsequently cut with Mini 

35 and BamHI, phenol extracted and ethanol precipitated. 
It was then resuspended and ligated (using T4 ligase) 
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into a pCFM1156 plasmid (Figure 2A) that contained a 
•RSH" signal sequence to make construct RSH-KGF (Figure 
3) The ligation products were transformed (according 
to "the method of Hanahan (1983), J. Mol. Biol.. 151:557) 
into E. coli strain FMS (ATCC: 53911) and plated onto 
LB+kanamycin at 28°C. Several transformants were 
selected and grown in small liquid cultures containing 
20 |ig/mL kanamycin. The RSH-KGF plasmid was isolated 
from the cells of each culture and UNA sequenced. 
Because of an internal Ndel site in the KGF gene, it was 
not possible to directly clone the native gene sequence 
into the desired expression vector with the bracketed 
restriction sites of Ndel and BamHI. This was 
accomplished as a three-way ligation. Plasmid RSH-KGF . 
was cut with the unique restriction sites of BsmI and 
SstX, and a -3 kbp UNA fragment (containing the 3 ■ end • 
of the KGF gene) was isolated following electrophoresis 
through a 1% agarose gel. A PCR <PCR#4) was carried out 
as described for PCR#3 except for the substitution of 
OLIGO#5 for OLIGO#3 . The PCR DNA product was then cut 
with Ndel and BsmI and a 311 bp DNA fragment was 
isolated following electrophoresis through a 4% agarose 
gel. The third fragment used in the ligation was a 1.8 
kbp DNA fragment of pCFMll56 cut with Ndel and SstJ 
isolated following electrophoresis through a 1% agarose 
gel. Following ligation <T4 ligase) , transformation, 
icanamycin selection and DNA sequencing as described 
above; a clone was picked containing the construct in 
Figure 4. and the plasmid designated KGF. Because of an 
internal ribosomal binding site that produced truncated 
products, the KGF DNA sequence between the unique JQpnl 
and EcoRJ sites^s replaced with chemically synthesized 
OLIGOs (0LIGC46 through OLIGOWl) to minimize the use of 
the internal start site (Figure 5) . 



35 
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OLIGOtl (SZQ ID NO:7) : 5 * -CAATGACCTAGGAGTAACAATCAAC-3 9 

OLIGO#2 (SEQ ID NO: 8) : S ' -AAAACAAACVTAAATCCACAAGTCCA-3 ' 

OLIGO#3 <SEQIDNO:9): 5 1 -ACAACGCGTGCAATGACATGACTCCA-3 * 

OLIGO#4 {SEQ ZD NO:10) : 

5 5 ' -ACAGGATCCTATTAAGTTATTGCCATAGGAA-3 ' 

OLIGO#5 (SZQ ID NO: 11): 

5 • -ACACATATGTGCAATGACATGACTCCA-3 • 
OLIGO#6 (SEQ ID NO:12) : 

5 ' -CTGCGTATCGACAAACGCGGCAAAGTCAAGGGCACCC-3 ■ 
10 OLIGO#7 (SEQ ID NO:13) : 

5 ' -AAGAGATGAAAAACAACTACAATATTATGGAAATCCGTACTGTT-3 • 
OLIGO#8 (SEQ ID NO:14): 

5 ■ -GCTGTTGGTATCGTTGCAATCAAAGGTGTTGAATCTG- 3 • 
OLIGO#9 (SEQ ID NO:15): 

15 5 ' •TCTTGGGTGCCCTTGACTTTGCCGCGTTTGTCGATACGCAGGTAC- 3 ■ 

OLIGO#10 (SEQ ID NO:16) : 

5 1 - ACAGCAACAGTACGGATTTCCATAATATTGTAGTTGTTTTTCATC- 3 • 
OLIGOtll (SEQ ID N0:17) : 

5 ' -AATTCAGATTCAACACCTTTGATTGCAACGATACCA-3 • 

20 

The OLIGOs were phosphorylated with T4 
polynucleotide kinase and then heat denatured. The 
single-strandeS <ss) OLIGOs were then allowed to form a 
ds DNA fragment by allowing the temperature to slowly 
25 decrease to room temperatur e . *T4 ligase was then used 
to covalently link both the internal OLIGO sticky-ends 
and the whole ds OLIGO fragment to the XGF plasmid cut 
with Kpnl and EcoRI. The new plasmid was designated 
KGF(dsd) . 

30 A completely E. coli codon-optimized KGF gene 

was constructed by PGR amplification of chemically 
synthesized OLIGOs #l2 through 24. 
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0LIG0#12 (SEQ ID NO: 18 ) : 5 * -AOTTTGATCTAGAAGGAGG-3 1 
OLIGO#13 ( SEQ ID NO : 19 ) : 5 1 -TCAAAACT03ATCCTATTAA-3 ' 
OLIG0414 (SEQ ID NO:20) : 

5 1 -ACTTTTTGATCTAGAAGGAGGAATAACATATCra 

5 TCCGGAACAGATGGCTACCAACGTTAACTGCTCCAGCCCGGAACGT-3 • 

OLIGO#lS (SEQ ID NO:21) : 

5 • -CACACCCGTAGCTACGACTACATGGAAGGTGGTGACATCCGTGTTC- 
GTCGTCTGTT C T G CCGTACCCAGTGGTACCTGCGTATCGACAAA-3 ' 
OLIGO#16 (SEQ ID NO:22) : 
10 5 ' -CGTGGTAAAGT7AAAGGTACCCAGGAAATGAAAAACAACTA* 

CAACATCATGGAAATCCCTACTGTT GC T G TTGCT 3 ' 

0LIG0U7 [SZQ ID NO: 23 ) : 

5 ' *GGTGTTGAATCTGAATTCTACCTGGCAATGAACAAAGAAGGTAAAC - 
TGTACGCAAAAAAAGAATGCAACGAAGACTGCAACTTCAAAGAA-- 3 * 
15 OLIGO»18 (SEQ ID NO: 24) : 

S 1 -CTGATCCTGGAAAACCACTACAACACCTACGCATCTG . 
CCCACAACGGTGGTGAAA7GTTCGTTGCTCTGAACCAGAAAGGT"3 ' 
OLIGO*19 (SEQ ID Np:25) : 

5 ' -ATCCCGGTTCGTGGTAAAAAAACCAAAAAAGAACAGAAAACCGCT- 
2 0 CACTTCCTGCCGATGGCAATCACTTAATAGGATCCA G TTTT G A- 3 ' 

OLIGO#20 (SEQ ID NO : 26 ) : 5 * -TACGGGTGTGACGTTCCGGG-3 * 
OLIGO# 21 ( SEQ ID NO : 27 ) : 5 • -CTTTACCACGTTTGTCGATA- 3 ' 
OLIGO#22 . (SEQ ID NO:28) : 5 * -ATTCAACACCnTGATTGCA-3 ' 
OLIGO#23 (SEQ ID NO : 29 ) : 5 ' -CCAGGATCAGTTCTTTGAAG-3 ' 
25 OLIGO#24 (SEQ ID NO : 3 0 ) : 5 * -rGAACCGGGATACCTTTCTGG-3 1 

OLIGOs #12 through 24 were " designed so that 
the entire DNA sequence encoding native KGF was 
represented by OLIGOs from either the "Watson - or the 

30 "Crick" strand and upon PGR amplif ication would produce 
the desired double-stranded DNA sequence (Figure 6) 
[PCR#5, Model 9&00 thermo cycler (Per kin-Elmer Cetus); 21 
cycles, each cycle consisting of 31 seconds at 94°C for 
denaturation, 31 seconds at 50°C for annealing, and 31 

35 seconds at 73°C for elongation; following the 21 cycles 
the PCR was finished with a final elongation step of 7 
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n^utesj. After PCR amplification, the DMA fragment ^ 
cut wxth Xtal and Ba.Hr and the 521 bp fragment^^ 
mto the expression plasmid pCFmi56 cut X 
enzymes. PCR#5 utilized the outside primers 
5 (100 paoles/100 pi rxn) OLIGO#i 2 and OLIGO#13 and 

'SrW 1 T °/ a KGF ten9late derived * 

(by T4 Ixgase) of OLIGOs #14 through #19 (OLIGOs#15 
through OLIGOs#18 were phosphorylated with T4 

10 e? H Cid f. kinaSe, ^ «*«20 through OLIGOs#24 

10 as band-aid oligos (Jayaraman et ml. (1992) 

BioCecUnigues. 12:392) for the ligation. The final 
construct was designated KGFfcodon optimized)'. 

All of the KGF analogs described herein are 

IS KGF(codon optimized,, or a combination of the two Thl 
sequences are further modified by the insertion into 
convenient restrictions sites of DMA sequences that 
encode the particular KGF analog amino acids made 

20 for^ " m ° re ° f *" techniques 
20 for DMA. fragment synthesis. Any of the analogs can be 
generated in their entirety by either of the above 

TrZ^ tSCiBliqUeS - Howev ~< as a part of the general 
OLIGO design optimized E. coli codons were used where 
appropriate, although the presence of E. coll optimized 
codons in part or in toto of any of the genes where 
examined did not significantly increase the yield of 
protein that could be obtained from cultured bacterial 
cells. Figures 7 to 12 set forth by convenient example ' 
particular KGF analog nucleotide and amino acid sequence 
30 constructions: C(1,15)S (Figure 7) , C(1,15>S/R(144)E 
(Figure 8); C(1.15)S/R(144)Q (Figure 9); AMIS (Figure 
10); ON23 (Figure 11, and aN23/R(144)Q (Figure 12) All 
the KGF analog constructions described herein were DMA ' 
sequence confirmed. 
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pyayrrole 2 1 Purification from E. coli 

Three different egression plasmids were 
5 utilized in the cloning of the KGF analog genes. They 
were pCFM1156 (ATCC 69702), pCFM1656 (ATCC 69576), and 
pCFK3102 (Figures 2A, 2B and 2C, respectively) . The 
plasmid p3102 can be derived from the plasmid pCFM1656 
by making a series of site directed base changes with 
10 PCR". overlapping oligo mutagenesis. Starting with the 
Bglll site (pCFM1656 plasmid bp #180) immediately 5* to 
the plasmid replication promoter, PcopB,' ™d proceeding 
toward the plasmid replication genes, the base pair 
changes are as follows: 

15 
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PCFM1656 OP # bP in PCTM1S56 bn changed to in nrm^^ 







# 204 


T/A 


C/G 






# 428 


A/T 


G/C 






# 509 


G/C 


A/T 


5 




# 617 




insert two G 






# 677 


G/C 


T/A 






# 978 

IT J 1 Sj 


T/A 


C/G 






# 992 


G/C 


A/T 






# 1002 


A/T 


C/G 


10 




# 1005 


C/G 


T/A 






# 1026 


A/T 


T/A 






# 1045 


C/G 


T/A 






# 1176 


G/C 


T/A 






# 1464 •■ 


G/C 


T/A 


15 




# 2026 


G/C 


bp deletion 






# 2186 


C/G 


T/A- 






# 2479 


A/T 


T/A 




# 


2498-2501 


AGTG 


GTCA 


20 






TCAC 


CAGT 




# 


2641-2647 


TCCGMC 


bp deletion 








AGGCTCG 




25 


# 


3441 


G/C 


A/T 




# 


3452 


G/C 


A/T 




# 


3649 


A/T 


T/A 




ft 


4556 




insert bps 



30 (SEQIDNO:43) 5 ' -GAGCTCAOTAGTGTCGACCTGCAG>3 ' 

(SEQ ID NO: 44) 5 ' -CTCGAGTGATCACAGCTGGACGTC - 3 1 

As seen above, pCFM1156, pCHH656 and pCFM3102 
are very similar to each other and contain many of the 

35 same restriction sites. The plasmids were chosen by 

convenience, and the vector DMA con?>onents can be easily 
exchanged for purposes of new constructs. The host used 
for all cloning was E. coli strain FM5 (ATCC: 53911) and 
the transformations were carried out (according to the 

40 method of Hanahan (1983), supra) or by electroelution 
with a Gene Pulser** transfection apparatus (BioRad 
Laboraties, Inc., Hercules, CA) , according to the 
manuf acturer 1 s protocol. 

Initially, a small, freshly-cultured inoculum 

45 of the desired recombinant £. coli clone harboring the 
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desired construct on one of the three pCFM vectors was 
started by transferring 0.1 mL of a frozen glycerol 
stock of the appropriate strain into a 2 L flask 
containing 500 mL of. Luria broth. The culture was 
shaken at 30'C for 16 hours. Thereafter the culture was 
transferred to a 15 L fennentor containing 8 L of 
sterile batch medium (Tsai, et al. (1987), j. irdustrial 
Microbiol., 2:181-187). 

Feed batch fermentation starts with the 
feeding of Feed # l medium (Tsai. ec al. (1987.), 
supra) . When the OD600 reached 35, expression of the 
desired KGF analog was induced by rapidly raising the 
culture temperature to 37'C to allow the amplification 
of plasmid. After two hours at 37'C, the culture 
temperature was quickly raised to 42 'C to denature the' 
CI repressor and the addition of Feed 1 was discontinued 
in favor of Feed 2. the addition rate of which was 
initiated at 300 mL/hr. Feed 2 comprised 175 g/L 
trypticase-peptone, 87.5 g/L yeast extract, and 260 g/L 
20 glucose. After one hour at 42 -C. the culture 
temperature was decreased to 36*C. where this 
• temperature was then maintained for another 6 hours. 

- The fermentation was then halted and the cells 
were harvested by centrifugation into plastic bags 
25 placed within 1 L centrifuge bottles. The cells were 
pelleted by centrifugation at 400 rpm for 60 minutes, 
after which the supernatants were removed and the cell 
paste frozen at -90 *C. 



IS 



Following expression of the various KGF 
30 analogs in E. coli, native KGF, C(1,15)S, 

C(1,15)S/R(144)E, C(1,15)S/R(144)Q, AN15, M22 , and 
AN23/R(144)Q protein were purified using the following 
procedure. Cell paste from a high cell density 
fermentation was suspended at 4"C in 0.2 M NaCl, 20 mM 
NaP0 4 , pH 7.5 as a 10-20% solution (weight per volume) 
using a suitable high shear mixer. The suspended cells 



35 
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were then lysed by passing the solution through a 
hcmogenizer (APV Gaulin, Inc., Everett, MA) three times. 
The outflowing homogenate was cooled to 4-8 # C by using a 
suitable heat exchanger. Debris was then removed by 
5 centrifuging the lysate in a J-6&" centrifuge (Beckman 
Instruments, Inc., Brea, CA) equipped with a JS 4.2 
rotor at 4,200 rpm for 30-60 min. at 4 # C. Supernatants 
were t he n carefully decanted and loaded onto a 
previously prepared 450 mL (5 cm x 23 cm) column of S- 

10 Sepharose Fast Flow™ resin (Pharmacia) column 

equilibrated with 0.2 M NaCl, 20 mM NaP0 4 , pH 7.5 at 
4*C. Next, the column was washed with five column 
volumes {2250 mL) of 0.4 M NaCl, 20 mM NaF0 4 , pH 7.5 at 
4*C. The desired protein was eluted by washing the 

15 column with 5 L of 0.5 M NaCl, 20 mM NaP0 4 , pH 7.5. 

Again, 50 mL fractions were collected and the A28O of the 
effluent was continuously monitored. Fractions 
identified by A28O as containing eluted material were 
then analyzed by SDS-PAGE through 14% gels to confirm 

20 the presence of the desired polypeptide. 

Those fractions containing proteins of 
interest were then pooled, followed by the addition of 
an equal volume of distilled water. The diluted sample 
was then loaded onto a previously prepared 450 mL (5 cm 

25 x 23 cm) column of S-Sepharose' Fast Flow equilibrated 

with 0.4 M NaCl, 20 mM NaP0 4 , pH 6.8 at 4*C. The column 
was washed with 2250 mL of 0.4 M NaCl, 20 mM NaP0 4 , pH 
6.8 and the protein eluted using a 20 column volume 
linear gradient ranging from 0.4 M NaCl, 20 mM NaP0 4 , pH 

30 6.8 to 0.6 M NaCl, 20 mM NaP0 4 , pH 6.8. Again, 50 mL 
fractions were collected under constant A28O monitoring 
of the effluent. Those fractions containing the protein 
(determined by 14% SDS-PAGE) were then pooled, followed 
by concentration through a YM-10 membrane (10,000 

35 molecular weight cutoff) in a 350cc stirring, cell 

(Amicon, Inc. Mayberry, MA) to a volume of 30-40 mL. 
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The concentrate was then loaded onto a 
previously generated 1,300 mL (4.4 cm x 85 cm) column of 
Superdex-75» resin (Pharmacia) equilibrated in col umn 
buffer comprising IX PBS (Dulbecco's Phosphate Buffered 
5 Saline, "D-PBS, • calcium and magnesium- free) or 0.15 M 
NaCl, 20 mM NaP0 4 , pH 7.0. After allowing the sample to 
run into the column, the protein was eluted from the gel 
filtration matrix using column buffer. Thereafter, 
10 mL fractions were recovered and those containing the 
10 analog (determined by 14% SDS-PAGE) were pooled. 

Typically, the protein concentration was about 5-10 
mg/mL in the resultant pool. All of the above 
procedures were performed at 4-8 *C, unless otherwise 
specified. 

• 15 An alternative purification procedure was used 

to purify native KGF, C(1,15)S and AN23. The procedure 
involves the following steps and, unless otherwise 
specified, all procedures, solutions and materials were 
conducted at 23 + 5'C. 

20 Upon completion of the production phase of a 

bacterial fermentation, the cell culture was cooled to 
4-8 *C and the cells were harvested by centrifugation or 
a similar- process . On the basis of the expected yield 
of protein per unit weight of cell paste and the amount 

25 of purified protein required, an appropriate amount of 
cell paste, by weight, was suspended in a mild buffer 
solution, 20 mM NaP04, 0.2 M NaCl, pH*7.5, weighing 
about five times that of the cell paste to be suspended. 
The cells were dispersed to a homogeneous solution using 

30 a high shear mixer. The temperature of the cell paste 
dispersion was maintained at 4-8 *C during 
homogeni zat ion . 

The cells were then lysed by pressure, for 
example by passing the cell paste dispersion twice 

35 through an appropriately sized cell homogenizer. The 
homogenate was kept chilled at 5 + 3°C. To clarify the 
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cell lysate, a previously prepared depth filter housing 
(Cuno, lac, Meriden, CT) equipped with a filter having 
an appropriate amount of filter surface area, 
equilibrated with a suitable volume of 0.2 M NaCl, 20 mM 
5 NaP0 4 , pH 7.5 was employed. The equilibration and * 
clarification were performed at 5 ± 3*C. Prior to 
clarification, an appropriate amount of a suitable 
filter aid was used to pre-coat the filter and be 
thoroughly mixed with the cell lysate, after which the 

10 lysate was clarified by passing the solution through the 
filter apparatus. The filter was washed with 0.2 M 
Nad, 20 mM NaP0 4 , pH 7.5. The filtrate and any 
subsequent wash were collected in a chilled container of 
suitable capacity, all the while being maintained at 

15 less than 10 *C. 

Following clarification the lysate was then 
passed through a previously prepared column of 
SP-Sepharose Fast Flow containing at least 1 mL of resin 
per 2 g of cell paste. The column of SP-Sepharose Fast 
20 Flow was equilibrated with cold {5 ± 3'C), 0.2 M NaCl, 
20 mM NaP0 4 . pH 7.5. The temperature of the column was 
maintained at less than 10 'C. The clarified lysate 
(5 ± 3-C) was ^then loaded onto the ion exchange column, 
with the absorbance at 280 nm (A 280 ) of eluate being 
25 continuously monitored. After' sample loading, the 

column was washed with cold 0.2 M NaCl. 20 mM NaP0 4 , • 
PH 7.5, followed by washing with 0.3 M NaCl, 20 mM 
NaP0 4 , pH 7.5 at 23 ± 5*C. 

To elute the desired protein, a linear 
gradient ranging from 0.2-1 M NaCl, 20 mM NaP0 4 , pH 7.5 
was used. Bulk product was collected in several 
fractions on the basis of the A 280 of the eluate. 
Following elution, these fractions were pooled and the 
volume noted. 

To oxidize free sulfhydryl groups, an 
oxidation step was performed. For proteins with altered 
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cysteine patterns, as compared to native KGF, an 
oxidizing agent (e.g., cystamine dihydrochloride or 
another appropriate oxidizing agent, for instance, 
cystine, oxidized glutathione or divalent copper) was 
5 added to a final concentration of 1-20 mM and the pH was 
adjusted to 7-9.5, with a pH of 9.0 ± 0.3 when cystamine 
dihydrochloride was used. The oxidation was conducted 
at 10 - 30 *C for an appropriate period. For the native 
KGF protein, oxidation was accomplished by adding an 

10 appropriate amount of (NH4)2S04 such as 1-2 M (NH4)2S04, 
adjusting the pH to 7.5-9.5, and holding the temperature 
at 23 ± 5*C for an appropriate period. 

After oxidation, the pH of the solution was 
adjusted to between 6.5 and 9.5. If necessary, solid . 

15 (NK4)2S04 was added to the solution to a final 

concentration of 2 M. To remove particulates, the 
solution was passed through appropriate clarification 
filters . 

The filtered, oxidized product was then 

20 subjected to hydrophobic interaction chromatography 

(HIC) . The HIC matrice was Butyl-650M Toyopearl» resin 
(Tosohaas, Inc., Montgomeryville, PA). The protein- 
containing solution was loaded onto the column, which 
had been previously equilibrated with 2 M {NH4)2S04, 0.15 

25 M NaCl, 20 mM NaP0 4 , pH 7».0. After sample loading, the 
column was washed with 2 M (NH4)2S0 4 , 0.15 M NaCl, 20 mM 
NaP0 4 , pH 7.0. The desired protein was then eluted 
using a decreasing linear CNH4)2S04 gradient ranging from 
2-0 M developed in 0.15 M NaCl, 20 mM NaP04, pH 7.0. 

30 When the desired protein began to elute, as indicated by 
an increase in the A 2 so of the eluate, fractions were 
collected. Aliti^ots of each fraction were then analyzed 
by SDS-PAGE. Those fractions containing the desired 
protein were then pooled, thoroughly mixed, and the 

35 volume of the pool determined, as was the concentration 
of the protein therein. 
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The pooled HIC protein-containing eluate was 
then concentrated and the elution buffer exchanged. 
Typically, proteins were concentrated to 5.0-10.0 mg/mL. 
Ultrafiltration was conducted using an ultrafiltration 
5 system equipped with a Pellicon** cassette system 

(Millipore, Inc., Bedford, MA) with an appropriately 
sized cut-off membrane 

After concentration, the sample was 
diafiltered against an appropriate buffer. The 
10 retentate from the concentration step was diafiltered 
against 0.15 M NaCl, 20 mM NaP04, pH 7.0 until the 
conductivity of the retentate was within 5% of the 
conductivity of the 0.15 M NaCl, 20 mM NaP0 4 , pH 7.0 
solution. 

15 In addition, to remove precipitates and 

bacterial endotoxin that might be present, the 
concentrated diafiltered protein-containing sample was 
passed through a 0.1 \sm Posidyne» filter (Pall, Inc., 
Cortland, NY) . After determining the protein 

20 concentration of the solution and on the basis of the 
desired concentration of the final, bulk product, the 
solution was diluted with 0.15 M NaCl, 20 mM sodium 
phosphate, pH 7.0, to the desired final concentration. 
A final aseptic filtration through a 0.22 pm filter, was 

25 then performed as the final bulk product was transferred 
to a pyrogen- free container for storage (at about 5*C) 
for further formulation. 



Example 3 : Purification from Mammalian Cell Culture 

30 

This example describes the exuression, 
isolation, and characterization of two biologically 
active recombinant KGF (rKGF) forms produced in a 
m amma lian expression system. 
35 The human KGF gene was isolated by PCR 

amplification of cDNA made from normal dermal human 
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fibroblast cells (Clonetec, Inc., Palo Alto, CA) . 
Following the making of cDNA by reverse transcriptase, 
PGR was used to amplify the KGF gene. 0LIG0#2S and 
OLIGO#26 were used to amplify the gene out of the cDNA 
5 and OLIGO#27 and OLIGOS28 were used to place HindXIX and 
Bglll restriction sites at the fragment ends by a second 
PGR amplification, as set forth in Figure 1. 



0LIG0#2S (SEQ ID NO: 45) 
10 6£lCO«26 (SEQ ID N0:46) 

OLI<30#27 (SEQ ID NO: 47) 
OLIGO#28 (SEQ ID NO: 48) 



S • -CAATCTACAATTCACAGA-3 ' 

S • -TTAAGTTATTGCCATAGG-3 ' 

S • -AACAAAGCTTCTACAATTCACAGATAGGA-3 ' 

5 • -AACAAGATCTTAAGTTATTGCCATAGG-3 ' 



Following cloning and DNA sequence 
15 confirmation, the KGF gene DNA was then used. 
Amplification was effected using two primers: 

OLIGO#29 (SEQ. ID. N0:49): 

5 ' -CGGTCTAGACCACCATGCACAAATGGATACTGACATGG— 3 

20 OLIGOt30 (SEQ. ID. NO-.50) : 

S • -GCCGTCGACCTATTAAGTTATTGCCATAGGAAG-3 ' 

The sense primer, OLIGO#29, included an Xbal 
site and" a consensus Kozak translation sequence <5'- 

25 CCACC-3') upstream of the start codon. ATG The 

antisense primer, OLIGO#30, included a Sail clonmg 
site and an additional stop codon. After 18 cycles of 
PCH amplification (30 sec. denaturation at 94'C 0 sec. 
annealing at SS'C, and 40 sec. elongation at 72 CK * 

30 product was digested with Xbal and Sail and 

a similarly digested DNA. of P DSRo2 (according to the 

methods of Bourdrel et ai. (1993), Protein Exp. & 
metnoos oe do B iochem. 
Puxif., 4:130-140 and Lu et al. (1992). 
Bicphys., 224:150-158). This resulted in plasmid 
KGF/pDSRo2 which placed the human KGF gene between the 
SV40 early promoter and the o-FSH polyadenylat^on 
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sequences. Two clones were picked and BNA sequence 
analysis confirmed construction of the desired vector. 

Two micrograms of KGF/pDSRa2 DNA were then 
linearized with Pvul. Chinese hamster ovary (CHO) 
5 cells, seeded the day before at 0.8 x 10* cells/60 xnzn 
culture dish, were then transfected with the treated DNA 
using a standard calcium phosphate precipitation method 
(Bourdrel et al., supra). Two weeks later, individual 
colonies were picked and transferred into 24-well 
10 plates. The conditioned media was considered serum- free 
when the cells reached, conf luency and aliquots thereof 
were analyzed by Western blotting using a polyclonal 
rabbit antiserum reactive against E. coli-expressed 
human KGF. 

15 • Westerns were performed by running samples 

through 12.5% (w/v) SDS polyacrylamide gels, followed by 
electroblotting for 1 hr. at 400 mA : onto nitrocellulose 
membranes using a semi dry transfer apparatus (Hoefer 
Scientific Instruments, San Francisco, CA) . 20 mM Tris, 

20 150 mM glycine, 20% methanol served as the transfer 
buffer. The nitrocellulose sheets were blocked by 
incubation with 10% normal goat serum in PBS. Rabbit 
anti-serum raided against E. coli-derived KGF was used 
as primary antibody. For use, it was diluted 1/10,000 

25 in 1% normal goat serum in PBS 'and incubated with the 
blocked nitrocellulose sheets for 12 hr. at room 
temperature, after which excess antibody was removed by 
three 30 min. washes in PBS. The nitrocellulose 
membranes were then incubated in 100 mL of 1% normal 

30 goat serum in PBS containing Vectastain w biotinylated 
goat anti-rabbit IgG (secondary antibody, Vector Labs, 
Burlingame, CA) , for 30 minutes at room temperature. 
After three 10 minute washes in PBS, a 30 minute room 
t emp erature incubation was performed in a 100 mL 

35 solution of 1% normal goat serum containing streptavidin 
and biotinylated peroxidase, prepared according to 
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manufacturer's directions (Vector Labs) . Following 
three washes in PBS, KGF cross-reactive material was 
visualized by incubation in a mixture of 60 pL of 30% 
(w/v) H2O2 in 100 mL of PBS and 50 mg of 4-chloronapthol 
5 in 20 mL of methanol. The reaction was stopped by 
rinsing in water after 10 minutes. 

Analysis of the blots revealed that the KGF- 
specific antibody associated with three distinct protein 
bands, two being closely related with molecular weights 
10 of about 25-29 kDa and one with an estimated molecular 
weight of about 17 kDa, as compared to the expected 
molecular weight of approximately 18.8 of the 163 amino 
acid mature protein. Additionally, several high 
expressing clones secreting more than 2.0 mg of rKGF per 
15 liter, as judged by Western analysis, were selected and 
expanded into roller bottles (according to the method of 
Lu et al., supra) to generate large volumes of serum- 
free conditioned medium for purification of KGF by 
cationic exchange chromatography and gel filtration, as 
20 set forth below. 

KGF from 3 L of serum- free conditioned medium 
was purified applying the medium directly to a cation 
exchange column (5 x 24 cm) packed with 450 mL of 
sulfoethyl column of SP-Sepharose Fast Flow (Pharmacia) 
25 pre-equilibrated with 20 mM sodium phosphate, pH 7.5. 
After washing with five column volumes of 20 mM sodium 
phosphate, 0.2 M NaCl, pH 7.5, rKGF was eluted using a 
20 column volume linear gradient of 0.2 to 1.0 M NaCl in 
20 mM sodium phosphate, pH 7.5. 50 mL fractions were 
30 collected with continuous A28O monitoring. KGF protein 
was detected by analyzing aliquots of each fraction by 
SDS-PAGE. SDS-PAGE was performed on an electrophoresis 
system (Novex, San Diego, CA) using precast 14% 
Tris-glycine precast gels (according to the method of 
35 Laemmli (1970), Mature, 222:680-685). Samples were 
mixed with non-reducing SDS sample buffer without 
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heating before loading. The proteins were detected by 
either Coomassie blue or silver staining. Two late- 
eluting peaks were seen to contain protein bands 
corresponding to the 25-29 kDa and 17 kDa bands detected 
5 by Western blot. The fractions containing each of these 
peaks were separately concentrated to a volume of less 
th^n 1.0 mL and subjected to gel filtration. 

The gel f iltrations employed columns of 
Superdexr75 T * resin (HR 10/30, Pharmacia) pre- 

10 equilibrated with PBS, pH 7.2, and calibrated with the 
following known molecular weight standards (BioRad, San 
Francisco, CA) : thyroglobulin (670 kDa), gamma globulin 
(158 kDa), ovalbumin (44 kDa), myoglobin (17 kDa) and 
vitamin B-12 (1.4 kDa). These purification steps 

15 resulted in an approximate 2000-fold purification of 
rKGF, specifically including a 17 kDa and a 30 kDa 
material, as estimated by silver staining. 

In the instance of the higher molecular weight 
material, rKGF eluted as a major symmetrical peak, which 

20 was called KGF-a. Upon SDS-PAGE analysis of a lesser 

amount of this material, 3 jig/lane versus 6 jxg/lane, two 

bands with a 1-2 kDa molecular weight difference were 

resolved. In the instance of the lower molecular weight 

material, termed KGF-b, gel filtration resulted in a 

» 

25 protein preparation having the expected mobility. For 
both KGF-a and KGF-b, the overall yield after 
purification was approximately 30-40%. 

Amino acid sequences from KGF-a and KGF-b were 
also analyzed. These analyses were performed on an 

30 automatic sequencer (Model 477A or 470A, Applied 

Biosystems, Inc., Foster City, CA) equipped with a Model 
12 OA on-line PTH-amino acid analyzer and a Model 900A 
data collection system (according to the method of Lu et 
al. (1991), J. Biol. Chem., 8102-8107) . Edman 

35 sequence analysis of KGF-a revealed a major N- terminal 
sequence of X1-N-D-M-T-P-E-Q-M-A-T-N-V-X2-X3-S- (SEQ ID 
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NO: SI) . A minor sequence starting from the third N- 
texminal amino acid, aspartic acid, was also present in 
1.6% of the total sequenceable protein. x lf x 2 , and X 3 
were the unassigned due to the absence of 
phenylthiohydantoinyl (PTH) amino acid signals during 
sequence analysis. 

Interestingly, N- terminal sequence analysis of 
KGF-b revealed an N-terminal amino acid sequence of 
S-Y-D-Y-M-E-G-G-D-I-R-V- (SEQ ID NO: 52), indicating that 
it is an N-terminally truncated form of KGF that has 
been proteolytically cleaved at the Arg23- Ser 24 peptide 
bond. 

To further characterize purified KGF-a and 
KGF-b, the protein was subjected to glycosidases 
15 (neuraminidase, O-glycanase, and/or N-glycanase) , using 
known techniques (Sasaki ec al. (1987), J. Biol. Chem.. " 
2£2:12059-12076; Takeuchi et al. (1988), J". Biol. Chem.. 
211:3657-3563 ; Zsebo et al. (1990), Cell, £2:195-201). 
These data indicate that KGF-a contains N- and O-linked 
carbohydrates, although the lower molecular weight form 
of KGF-a probably contains only N-linked sugar. 
Glycosidase treatment did not cause molecular weight 
reduction- for KGF-b, indicating that the molecule is 
unglycosylated . 



20 



25 



30 



SXSOTPle. 4: Biological Activity 



Each KGF analog was diluted and assayed for 
biological activity by measuring the [ 3 H] -thymidine 
uptake of Balb/MK cells (according to the method of 
Rubin ee al. (1989), supra). The samples were first 
diluted in a bioassay medium consisting of 50% customer- 
made Eagle's MEM, 50% customer-made F12, 5 jig/mL 
transferrin, 5 ng/mL sodium selenite, 0.0005% HSA and 
35 0.005% Tween 20. KGF samples were then added into 
Falcon primeria 96-well plates seeded with Balb/MK 
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cells. Incorporation of [%] -Thymidine during DNA 
synthesis was measured and converted to input native KGF 
concentration by comparison to a native KGF standard 
curve. Each of the tested analogs' exhibited mitogenic 
5 activity. 

interaction with the KGF receptor was examined 
using isolated KGF receptor membrane preparations 
prepared from Balb/MK mouse epidermal keratinocytes (by 
the procedure described by Massague (19932), j m Biol. 
10 Cbem., 25fi:13614-13620). Specifically, various forms of 
KGF were diluted with 50 mM Tris-HCl, pH 7.5, containing 
0.2% bovine serum albumin so as to range in 
concentration from 0.8 ng to 100 ng per 50 jiL. They 
were individually incubated with the membrane 
15 preparation (75 ng/mL) and 125 x . labeled r# coli-derived 
KGF (1.5 ng) . Receptor binding and competition 
experiments were performed at 4'C for 16 hr., after 
which time samples were taken, centrifuged, and washed 
twice with the above diluent buffer to remove unbound 
and non-specifically bound, labeled KGF. Samples were 
then counted for the remaining radioactivity. 
Competition curves for receptor binding between KGF 
samples and labeled KQF were constructed by plotting ' 
percent uncompetition versus concentrations of each KGF 
25 sample. Radioreceptor assay uncompetition experiments 
indicated that E. coli-derived KGF, KGF-a, and KGF-b 
have similar receptor binding activity. 

While the present invention has been described 
above both generally and in terms of preferred 
embodiments, it is understood' that other variations and 
modifications will occur to those skilled in the art in 
light of the description above. 



20 



30 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Amgen Inc. 



(ii) TITLE OF INVENTION: Method for Purifying Keratinocyte 

Growth Factors 

(iii) NUMBER OF SEQUENCES: 52 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Amgen Inc. 

(BJ STREET: 1840 DeHavilland Drive 

(C) CITY: Thousand Oaks 

(D) STATE: California 

(E) COUNTRY: U.S.A. 

(F) ZIP: 91320-1789 

(v) COMPUTER READABLE FORM: 

(A) ME DIUM T YPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0. Version U.2S 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/487,830 

(B) FILING DATE: 

(CJ CLASSIFICATION: not yet known 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 862 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : unknown, 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

CAATCTACAA TTCACAGATA GGAAGAGGTC AATGACCTAG GAGTAACAAT CAACTCAAGA 60 

TTCATTTTCA TTATGTTATT CAl^AACACC CGGAGCACTA CACTATAATG CACAAATGGA 120 

TACTGACATG GATCCTGCCA ACTTTGCTCT ACAGATCATG CTTTCACATT ATCTGTCTAG 180 

TGGGTACTAT ATCTTTAGCT TGCAATGACA TGACTCCAGA GCAAATGGCT ACAAATGTGA " 240 

ACTGTTCCAG CCCTGAGCGA CACACAAGAA GTTATGATTA CATGGAAGGA GGGGATATAA 300 
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GAGTGAGAAG ACT C TTCTCT CGAACACAGT GGTACCTGAG GATCGATAAA AGAGGCAAAG 360 

TAAAAGGGAC CCAAGAGATG AAGAATAATT ACAATATCAT GGAAATCAGG ACAGTGGCAG 420 

TTGGAATTGT GGCAATCAAA GGGGTGGAAA GTGAATTCTA TCTTGCAATG AACAAGGAAG 480 

GAAAACTCTA TGCAAAGAAA GAATGCAATG AAGATTGTAA CTTCAAAGAA CTAATTCTGG 540 

AAAACCATTA GAACACATAT GCATCAGCTA AATGGACACA CAACGGAGGG GAAATGTTTG 600 

TTGCCTTAAA TCAAAAGGGG ATTCCTGTAA GAGGAAAAAA AACGAAGAAA GAACAAAAAA 660 

CAGCCCACTT TCTTCCTATG GCAATAACTT AATTGCATAT GGTATATAAA GAACCCAGTT 720 

CCAGCAGGGA GATTTCTTTA AGTGGACTGT T rfCmiTT CTCAAAATTT TCTTTCCTTT 780 

TATTTTTTAG TAATCAAGAA AGGCTGGAAA AACTACTGAA AAACTGATCA AGCTGGACTT 840 

GTGCATTTAT GTTTGTTTTA AG 862 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 194 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met His Lys Trp lie Leu Thr Trp lie Leu Pro Thr Leu Leu Tyr Arg 
1 5_ 10 15 

Ser Cys Phe His He He Cys Leu Val Gly Thr He Ser Leu Ala Cys 
20 25 30 

Asn Asp Met Thr Pro Glu Gin Met Ala Thr Asn Val Asn Cys Ser Ser 
35 40 45 

Pro Glu Arg His Thr Arg Ser Tyr Asp Tyr Met Glu Gly Gly Asp He 
50 55 60 

Arg Val Arg Arg Leu Phe Cys Arg Thr Gin Trp Tyr Leu Arg He Asp 
65 70 75 80 

Lys Arg Gly Lys Val Lys Gly Thr Gin Glu Met Lys Asn Asn Tyr Asn 
85 « 90 95 

He Met Glu He Arg Thr Val Ala Val Gly He Val Ala lie Lys Gly 
100 105 110 

Val Glu Ser Glu Phe Tyr Leu Ala Met Asn Lys Glu Gly Lys Leu Tyr 
115 120 125 
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Ala Lys Lys 
130 



Glu Cys Asn Glu Asp 
135 



Cys Asn Phe Lys Glu Leu lie Leu 
140 



Glu Asn His 
145 



Tyr Asn Thr Tyr Ala 
150 



Ser Ala Lys Trp Thr His Asn Gly 
155 160 



Gly Glu Met Phe Val Ala Leu Asn Gin Lys Gly He Pro Val Arg Gly 
165 170 175 

Lys Lys Thr Lys Lys Glu Gin Lys Thr Ala His Phe Leu Pro Met Ala 
180 185 190 

He Thr 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 595 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: unknown 
(D) TOPOLOGY: unknown 

<ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

ATCGATTTGA TTCTAGAAGG AGGAATAACA TATGAAAAAG CGCGCACGTG CTATCGCCAT 60 

TGCTGTGGCT CTGGCAGGTT TCGCAACTAG TGCACACGCG TGCAATGACA TGACTCCAGA 120 

GCAAATGGCT ACAAATGTGA ACTGTTCCAG CCCTGAGCGA CACACAAGAA GTTATGATTA 180 

CATGGAAGGA GGGGATATAA GAGTGAGAAG ACTCTTCTGT CGAACACAGT GGTACCTGAG 240 

GATCGATAAA AGAGGCAAAG TAAAAGGGAC CCAAGAGATG AAGAATAATT ACAATATCAT 300 

GGAAATCAGG ACAGTGGCAG TTGGAATTGT GGCAATCAAA GGGGTGGAAA GTGAATTCTA 360 

TCTTGCAATG AACAAGGAAG GAAAACTCTA TGCAAAGAAA GAATGCAATG AAGATTGTAA 420 

CTTCAAAGAA CTAATTCTGG AAAACCATTA CAACACATAT GCATCAGCTA AATGGACACA 480 

CAACGGAGGG GAAATGTTTG TTGCCTTAAA TCAAAAGGGG ATTCCTGTAA GAGGAAAAAA 540 

AACGAAGAAA GAACAAAAAA CAGCCCACTT TCTTCCTATG GCAATAACTT AATAG 595 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 186 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 
<D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Lys Lys Arg Ala Arg Ala He Ala He Ala Val Ala Leu Ala Glv 
1 5 10 15 

Phe Ala Thr Ser Ala His Ala Cys Asn Asp Met Thr Pro Glu Gin Met 
20 25 30 

Ala Thr Asa Val Asn Cys Ser Ser Pro Glu Arg His Thr Arg Ser Tyr 
35 40 45 

Asp Tyr Met Glu Gly Gly Asp He Arg Val Arg Arg Leu Phe Cys Arc 
50 55 60 

Thr Gin Trp Tyr Leu Arg He Asp Lys Arg Gly Lys Val Lys Gly Thr 
65 70 75 80 

Gin Glu Met Lys Asn Asn Tyr Asn He Met Glu He Arg Thr Val Ala 
85 90 95 

Val Gly He Val Ala He Lys Gly Val Glu Ser Glu Phe Tyr Leu Ala 
100 105 HQ 

Met Asn Lys Glu Gly Lys Leu Tyr Ala Lys Lys Glu Cys Asn Glu Asp 
115 120 125 

Cys Asn Phe Lys Glu Leu He Leu Glu Asn His Tyr Asn Thr Tyr Ala 
130 135 140 

Ser Ala Lys Trp Thr His Asn Gly Gly Glu Met Phe Val Ala Leu Asn 
145 150 155 160 

Gin Lys Gly He Pro Val Arg Gly Lys Lys Thr Lys Lys Glu Gin Lys 
165.. 170 175 

Thr Ala His Phe Leu Pro Met Ala He Thr 
180 185 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 499 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

TATGTGCAAT GACATGACTC CAGAGCAAAT GGCTACAAAT GTGAACTGTT CCAGCCCTGA 60 

GCGACACACA AGAAGTTATG ATTACATGGA AGGAGGGGAT ATAAGAGTGA GAAGACTCTT 120 

CTGTCGAACA CAGTGGTACC TGAGGATCGA TAAAAGAGGC AAAGTAAAAG GGACCCAAGA 180 
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GATGAAGAAT AATTACAATA TCATGGAAAT CAGGACAGTG GCAGTTGGAA TTGTGGCAAT 240 

CAAAGGGGTG GAAAGTGAAT TCTATCTTGC AATGAACAAG GAAGGAAAAC TCTATGCAAA 300 

GAAAGAATGC AATGAAGATT GTAACTTCAA AGAACTAATT CTGGAAAACC ATTACAACAC 360 

ATATGCATCA GCTAAATGGA CACACAACGG AGGGGAAATG TTTGTTGCCT TAAATCAAAA 420 

GGGGATTCCT GTAAGAGGAA AAAAAACGAA GAAAGAACAA AAAACAGCCC A CTTTCTTC C 480 

TATGGCAATA ACTTAATAG 499 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 164 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Cys Asn Asp Met Tar Pro GIu Gin Met Ala Thr Asn Val Asn Cys 
1 5 10 15 

Ser Ser Pro Glu Arg His Thr Arg Ser Tyr Asp Tyr Met Glu Gly Gly 
20 25 30 

Asp lie Arg Val Arg Arg Leu Phe Cys Arg Thr Gin Trp Tyr Leu Arg 
35 40 45 

He Asp Lys Arg Gly Lys Val Lys Gly Thr Gin Glu Met Lys Asn Asn 
50 - 55 60 

Tyr Asn He Met Glu He Arg Thr Val Ala Val Gly He Val Ala He 
65 70 75 80 

Lys Gly Val Glu Ser Glu Phe Tyr Leu Ala Met Asn Lys Glu Gly Lys 
85 90 95 

Leu TVr Ala Lys Lys Glu Cys Asn Glu Asp Cys Asn Phe Lys Glu Leu 
100 105 HO 

He Leu Glu Asn His Tyr Asn Thr Tyr Ala Ser Ala Lys Trp Thr His 
115 120 125 

Asn Gly Gly Glu Met^Rhe Val Ala Leu Asn Gin Lys Gly He Pro Val 
130 135 140 

Arg Gly Lys Lys Thr Lys Lys Glu Gin Lys Thr Ala His Phe Leu Pro 
145 150 155 ISO 

Met Ala He Thr 
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(2) INFORMATION FOR SEQ ID NO:7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 

CAATGACCTA GGAGTAACAA TCAAC 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 
(A J LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
AAAACAAACA TAAATGCACA AGTCCA 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
ACAACGCGTG CAATGACATG ACTCCA 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
ACAGGATCCT ATTAAGTTAT TGCCATAGGA A 
(2) INFORMATION FOR SEQ ID NO:ll: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
ACACATATGT GCAATGACAT GACTCCA 
(2) INFORMATION FOR SEQ ID NO:12: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS ; unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
CTGCGTATCG ACAAACGCGG CAAAGTCAAG GGCACCC 
(2) INFORMATION FOR SEQ ID NO:13: 

(i) SEQUENCE CHARACTERISTICS: ' 

(A) LQJGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
AAGAGATGAA AAACAACTAC AATATTATGG AAATCCGTAC TGTT 
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(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
GCTGTTGGTA TCGTTGCAAT CAAAGGTGTT GAATCTG 
(2) INFORMATION FOR SEQ ID NO: IS: 

(i) SEQUQJCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 15 : 
TCTTGGGTGC CCTTGACTTT GCCGCGTTTG TCGATACGCA GGTAC 45 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LQIGTH: 45~ base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
ACAGCAACAG TACGGATTTC CATAATATTG TA GTTGTTTT TCATC 45 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
AATTCAGATT CAACACCTTT GATTGCAACG ATACCA 35 
(2) INFORMATION FOR SEQ ID NO:18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
AGTTTTGATC TAGAAGGAGG 20 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
TCAAAACTGG ATCCTATTAA 20 
(2) INFORMATION FOR SEQ ID NO:20: 

(i) SEQUENCE CHARACTERISTICS: ' 

(A) LENGTH: 91 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

s. ^ 

AGTTTTGATC TAGAAGGAGG AATAACATAT GTGCAACGAC ATGACTCCGG AACAGATGGC SO 
TACCAACGTT AACTGCTCCA GCCCGGAACG T 91 
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(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

CACACCCGTA GCTACGACTA CATGGAAGGT GGTGACATCC GTGTTCGTCG TCTGTTCTGC 60 

CGTACCCAGT GGTACCTGCG TATCGACAAA 90 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LQflGTH: 90 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
CGTGGTAAAG TTAAAGGTAC CCAGGAAATG AAAAACAACT ACAACATCAT GGAAATCCGT 60 
ACTGTTGCTG TTGGTATCGT TGCAATCAAA 90 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEJGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUEN C E DESCRIPTION: SEQ ID NO: 23: 
GGTGTTGAAT CTGAATTCTA CCTGGCAAT(f AACAAAGAAG GTAAACTGTA CGCAAAAAAA 
GAATGCAACG AAGACTGCAA CTTCAAAGAA 



60 
90 
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(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 
CTGATCCTGG . AAAACCACTA CAACACCTAC GCATCTGCTA AATGGACCCA CAACGGTGGT 60 
GAAATGTTCG TTGCTCTGAA CCAGAAAGGT 90 
(2) INFORMATION FQR SEQ ID NO:25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 88 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
ATCCCGGTTC GTGGTAAAAA AACCAAAAAA GAACAGAAAA CCGCTCACTT CCTGCCGATG 60 
GCAATCACTT AATAGGATCC A GTTT T GA 88 
(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
TACGGGTGTG ACGTTCCGGG 
(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 
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(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 



CTTTACCACG TTTGTCGATA 



20 



(2) INFORMATION FOR SEQ ID NO:28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
ATTCAACACC TTTGATTGCA 20 
(2) INFORMATION FOR SEQ ID NO:29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 
CD) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
CCAGGATCAG TTCTTTGAAG , 20 

(2) INFORMATION FOR SEQ ID NO:30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LQ1GTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO: 30: 



GAACCGGGAT ACCTTTCTGG 



20 
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(2) INFORMATION FOR SEQ ED NO:31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 495 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
ATGTCTAATG ATATGACTCC GGAACAGATG GCTACCAACG TTAACTCCTC CTCCCCGGAA 
CGTCACACGC GTTCCTACGA CTACATGGAA GGTGGTGACA TCCGCGTACG TCGTCTGTTC 
TCCCGTACCC AGTGGTACCT GCGTATCGAC AAACGCGGCA AAGTCAAGGG CACCCAAGAG 
ATGAAAAACA ACTACAATAT TATGGAAATC CGTACTGTTG CTGTTGGTAT CGTTGCAATC 
AAAGGTGTTG AATCTGAATT CTACCTGGCA ATGAACAAAG AAGGTAAACT GTACGCAAAA 
AAAGAATGCA ACGAAGACTG CAACTTCAAA GAACTGATCC TGGAAAACCA CTACAACACC 
TACGCATCTG CTAAATGGAC CCACAACGGT GGTGAAATGt' TCGTTGCTCT GAACCAGAAA 
GGTATCCCGG TTCGTGGTAA AAAAACCAAA AAAGAACAGA AAACCGCTCA CTTCCTGCCG 

ATGGCAATCA CTTAA 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LHJGTH: 164 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 

Met Ser Asn Asp Met Thr Pro Glu Gin Met Ala Thr Asn Val Asn Ser 

1 5 . . 10 

Ser Ser Pro Glu Arg His Thr Arg Ser Tyr Asp Tyr Met Glu Gly Gly 



60 
120 
180 
240 
300 
360 
420 
480 
495 



20 



Asp He Arg Val Arg Arg Leu Pha Cys Arg Thr Gin Trp Tyr Leu Arg 

35 40 
lie Asp Lys Arg Gly Lys Val Lys Gly Thr Gin Glu Met Lys Asn Asn 



50 



55 
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Tyr Asn lie Met Glu lie Arg Thr Val Ala Val Gly He Val Ala He 
65 70 75 80 

Lys Gly Val Glu Ser Glu Phe Tyr Leu Ala Met Asn Lys Glu Glv Lvs 
85 90 95* r 

Leu Tyr Ala Lys Lys Glu Cys Asn Glu Asp Cys Asn Phe Lys Glu Leu 
100 105 no 

He Leu Glu Asn His Tyr Asn Thr Tyr Ala Ser Ala Lys Trp Thr His 
115 120 125 

Asn Gly Gly Glu Met Phe Val Ala Leu Asn Gin Lys Gly He Pro Val 
130 135 140 

Arg Gly Lys Lys Thr Lys Lys Glu Gin Lys Thr Ala His Phe Leu Pro 
145 150 155 i 6 o 

Met Ala He Thr 



(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) Lm?TH: 495 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cBNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 

ATGTGCAATG ATATGACTCC TGAACAAATG GCTACCAATG TCAACTGTTC CTCTCCGGAG 60 

CGCCACACCC GGAGTTACGA TTACATGGAA GGTGGGGATA TTCGCGTACG TCGTCTGTTC 120 

» 

TGCCGTACCC AGTGGTACCT GCGTATCGAC AAACGCGGCA AAGTCAAGGG CACCCAAGAG 180 

ATGAAAAACA ACTACAATAT TATGGAAATC CGTA CTGTT G CTGTT G GTAT CGTTGCAATC 240 

AAAGGTGTTG AATCTGAATT CTATCTTGCA ATGAACAAGG AAGGAAAACT CTATGCAAAG 300 

AAAGAATGCA ATGAAGATTG TAACTTCAAA GAACTAATTC TGGAAAACCA TTACAACACA 360 

TATGCATCTG CEAAATGGAC CCACAACGGT GGTGAAATGT TCGTTGCTCT GAACCAGAAA 420 

GGTATCCCTG TTCAAGGTAA GAAAACCAAQ* AAAGAACAGA AAACCGCTCA CTT CCTG CCG 480 

ATGGCAATCA CTTAA 4 gc 
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(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUE NCE CH ARACTERISTICS: 

(A) LENGTH: 164 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

Met Cys Asn Asp Met Thr Pro Glu Gin Met Ala Thr Asn Val Asn Cys 
15 10 is 

Ser Ser Pro Glu Arg His Thr Arg Ser Tyr Asp Tyr Met Glu Gly Gly 
20 25 30 

Asp He Arg Val Arg Arg Leu Phe Cys Arg Thr Gin Trp Tyr Leu Arg 
35 40 45 

He Asp Lys Arg Gly Lys Val Lys Gly Thr Gin Glu Met Lys Asn Asn 
50 .55 60 

Tyr Asn He Met Glu He Arg Thr Val Ala Val Gly He Val Ala He 
65 70 . 75 80 

Lys Gly Val Glu Ser Glu Phe Tyr Leu Ala Met Asn Lys Glu Gly Lys 
85 90 95 

Leu Tyr Ala Lys Lys Glu Cys Asn Glu Asp Cys Asn Phe Lys Glu Leu 
100 105 110 

He Leu GluT Asn His Tyr Asn Thr Tyr Ala Ser Ala Lys Trp Thr His 
115 120 125 

Asn Gly Gly Glu Met Phe Val Ala Leu Asn Gin Lys Gly He Pro Val 
130 135 140 

Gin Gly Lys Lys Thr Lys Lys Glu Gin Lys Thr ..Ala His Phe Leu Pro 
145 150 155 160 

Met Ala He Thr 



(2) INFORMATION FOR SEQ ID NO:35: • 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 49V base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 
{D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ 10 NO: 35: 

ATGTCTAATG ATATGACTCC GGAACAGATG GCTACCAACG TTAACTCCTC CTCCCCGGAA 60 

CGTCACACGC GTTCCTACGA CTACATGGAA GGTGGTGACA TCCGCGTACG TCGTCTGTTC 120 

TGCCGTACCC AGTGGTACCT GCGTATCGAC AAACGCGGCA AAGTCAAGGG CACCCAAGAG 180 

ATGAAAAACA ACTACAATAT TATGGAAATC CGTACTGTTG CTGTTGGTAT CGTTGCAATC 240 

AAAGGTGTTG AATCTGAATT CTATCTTGCA ATGAACAAGG AAGGAAAACT CTATGCAAAG 300 

AAAGAATGCA ATGAAGATTG TAACTTCAAA GAACTAATTC TGGAAAACCA TTACAACACA 360 

TATGCATCTG CTAAATGGAC CCACAACGGT GGTGAAATGT TCGTTGCTCT GAACCAGAAA 420 

GGTATCCCTG TTCAAGGTAA GAAAACCAAG AAAGAACAGA AAACCGCTCA CTTCCTGCCG 480 

ATGGCAATCA CTTAA 495 
(2) INFORMATION FOR SEQ ID NO:36: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 164 amino acids 

(B) TYPE: amino acid 

(C) STRANDESNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:36: 

Met Ser Asn Asp Met Thr Pro Glu Gin Met Ala Thr Asn Val Asn Ser 
1 5 : " 10 15 

Ser Ser Pro Glu Arg His Thr Arg Ser Tyr Asp Tyr Met Glu Gly Gly 
20 25 • 30 

Asp lie Arg Val Arg Arg Leu Phe Cys Arg Thr Gin Trp Tyr Leu Arg 
35 40 45 * 

lie Asp Lys Arg Gly Lys Val Lys Gly Thr Gin Glu Met Lys Asn Asn 
50 55 60 

Tyr Asn He Met Glu He Arg Thr Val Ala Val Gly He Val Ala He 
65 70 75 80 

Lys Gly Val Glu Ser Glu Phe, Tyr Leu Ala Met Asn Lys Glu Gly Lys 
85 90 95 

Leu Tyr Ala Lys Lys Glu Cys Asn Glu Asp Cys Asn Phe Lys Glu Leu 
100 105 110 

He Leu Glu Asn His Tyr Asn Thr Tyr Ala Ser Ala Lys Trp Thr His 
115 120 125 
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Asn Gly Gly Glu MeC Phe Val Ala Leu Asa Gin Lys Gly He Pro Val 
130 135 140 

Gin Gly Lys Lys Thr Lys Lys Glu Gin Lys Thr Ala His Phe Leu Pro 
1« 150 155 160 

Met Ala lie Thr 



(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 450 base pairs 

(B) - TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

ATGTCTTCTC CTGAACGTCA TACGCGTTCC TACGACTACA TGGAAGGTGG TGACATCCGC 60 

GTACGTCGTC TGTTCTGCCG TACCCAGTGG TACCTGCGTA TCGACAAACG CGGCAAAGTC 120 

AAGGGCACCC AAGAGATGAA AAACAACTAC AATATTATGG AAATCCGTAC TGTTGCTGTT 180 

GGTATCGTTG CAATCAAAGG TGTTGAATCT GAATTCTACC TGGCAATGAA CAAAGAAGGT 240 

AAACTGTACG CAAAAAAAGA ATGCAACGAA GACTGCAACT TCAAAGAACT GATCCTGGAA 300 

AACCACTACA ACACCTACGC ATCTGCTAAA TGGACCCACA ACGGTGGTGA AATGTTCGTT 360 

GCTCTGAACC AGAAAGGTAT CCCGGTTCGT GGTAAAAAAA CCAAAAAAGA ACAGAAAACC 420 

GCTCACTTCC TGCCGATGGC AATCACTTAA 450 
(2) INFORMATION FOR SEQ ID NO:38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 149 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

Mec Ser Ser Pro Glu Arg His Thr Arg Ser Tyr Asp Tyr Met Glu Gly 
15 10 15 

Gly Asp lie Arg Val Arg Arg Leu Phe Cys Arg Thr Gin Trp Tyr Leu 
20 25 30 



W0 9OT1352 



PCr/US95/13099 



47 - 



Arg lie Asp Lys Arg Gly Lys Val Lys Gly Thr Gin Glu Met Lys Asn 
35 40 45 

Asn Tyr Asn He Met Glu He Arg Thr Val Ala Val Gly He Val Ala 
50 55 60 

Xle Lys Gly Val Glu Ser Glu Phe Tyr Leu Ala Met Asn Lys Glu Gly 
65 70 75 80 

Lys Leu Tyr Ala Lys Lys Glu Cys Asn Glu Asp Cys Asn Phe Lys Glu 
85 90 95 

Leu He Leu Glu Asn His Tyr Asn Thr Tyr Ala Ser Ala Lys Trp Thr 
100 105 110 

His Asn Gly Gly Glu Met Phe Val Ala Leu Asn Gin Lys Gly He Pro 
115 120 125 

Val Arg Gly Lys Lys Thr Lys Lys Glu Gin Lys Thr Ala His Phe Leu 
130 135 140 

Pro Met Ala He Thr 
145 

(2) INFORMATION FOR SEQ ID NO:39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 426- base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : unknown 

(D) TOPOLOGY: unknown 

<ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:39: 

ATGTCCTACG ACTACATGGA AGGTGGTGAC ATCCGCGTAC GTCGTCTGTT CTGCCGTACC 60 

CAGTGGTACC TGCGTATCGA CAAACGCGGC AAAGTCAAGG GCACCCAAGA GATGAAAAAC 120 

AACTACAATA TTATGGAAAT CCGTA C T G TT GCTGTT GG TA TCGTTGCAAT CAAAGGTGTT 180 

GAATCTGAAT TCTACCTGJC AATGAACAAA GAAGGTAAAC TGTACGCAAA AAAAGAATGC 240 

AACGAAGACT GCAACTTCAA AGAACTGATC CTGGAAAACC ACTACAACAC CTACGCATCT 300 

GCTAAATGGA CCCACAACGG TGGTGAAATG TTCGTTGCTC TGAACCAGAA AGGTATCCCG 360 

GTTCGTGGTA AAAAAACCAA AAAAGAACAG AAAACCGCTC ACTTCCTGCC GATGGCAATC 420 

ACTTAA 426 
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(2) INFORMATION FOR SEQ ZD NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 141 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

Met Ser Tyr Asp Tyr Met Glu Gly Gly Asp lie Arg Val Arg Arg Leu 
15 10 15 

Phe Cys Arg Thr Gin Trp Tyr Leu Arg He Asp Lys Arg Gly Lys Val 
20 25 30 

Lys Gly Thr Gin Glu Met Lys Asn Asn Tyr Asn He Met Glu He Arg 
35 40 45 

Thr Val Ala Val Gly He Val Ala He Lys Gly Val Glu Ser Glu Phe 
50 55 60 

Tyr Leu Ala Met Asn Lys Glu Gly Lys Leu Tyr Ala Lys Lys Glu Cys 
65 70 75 80 

Asn Glu Asp Cys Asn Phe Lys Glu Leu He Leu Glu Asn His Tyr Asn 
85 90 95 

Thr Tyr Ala Ser Ala Lys Trp Thr His Asn Gly Gly Glu Met Phe Val 
100 105. 110 

Ala Leu Asn^Gln Lys Gly He Pro Val Arg Gly Lys Lys Thr Lys Lys 
115 120 125 

Glu Gin Lys Thr Ala His Phe Lei; Pro Met Ala He Thr 
130 135 140 



(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 426 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : unknown 

(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: cDfcA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

ATGTCCTACG ACTACATGGA AGGTGGTGAC ATCCGCGTAC GTCGTCTGTT CTGCCGTACC 60 

CAGTGGTACC TGCGTATCGA CAAACGCGGC AAAGTCAAGG GCACCCAAGA GATGAAAAAC 120 

AACTACAATA' TTATGGAAAT CCGTACTGTT GCTGTTGGTA TCGTTGCAAT CAAAGGTGTT 180 

GAATCTGAAT TCTATCTTGC AATGAACAAG GAAGGAAAAC TCTATGCAAA GAAAGAATGC 240 

AATGAAGATT GTAACTTCAA AGAACTAATT CTGGAAAACC ATTACAACAC ATATGCATCT 300 

GCTAAATGGA CCCACAACGG TGGTGAAATG TTCGTTGCTC TGAACCAGAA AGGTATCCCT 360 

GTTCAAGGTA AGAAAACCAA GAAAGAACAG AAAACCGCTC ACTTCCTGCC GATGGCAATC 420 

ACTTAA 426 
(2) INFORMATION FOR SEQ ID NO: 42: 

U> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 141 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

Met Ser Tyr Asp Tyr Met Glu Gly Gly Asp He Arg Val Arg Arg Leu 
15 10 15 

Phe Cys Arg Thr Gin Trp Tyr Leu Arg He Asp Lys Arg Gly Lys Val 
20 25 30 

Lys Gly Thr Gin Glu Met Lys Asn Asn Tyr Asn He Met Glu He Arg 
35 40 45 

Thr Val Ala Val Gly He Val Ala He Lys Gly Val Glu Ser Glu Phe 
50 55 60 

Tyr Leu Ala Met Asn Lys Glu Gly Lys Leu Tyr Ala Lys Lys Glu Cys 
65 70 75 80 

Asn Glu Asp Cys Asn Phe Lys Glu Leu He Leu Glu Asn His Tyr Asn 
85 90 95 

Thr Tyr Ala Ser Ala Lys Txp thr His Asn Gly Gly Glu Met Phe Val 
100 105 110 

Ala Leu Asn Gin Lys Gly He Pro Val Gin Gly Lys Lys Thr Lys Lys 
115 120 125 

Glu Gin Lys Thr Ala His Phe Leu Pro Met Ala He Thr 
130 135 140 
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(2) INFORMATION FOR SEQ ID 110:43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43 
GAGCTCACTA GTGTCGACCT GCAG 
(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: complement (1..24) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44 
CTGCAGGTCG ACACTAGTGA GCTC 
(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: , 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45 
CAATCTACAA TTCACAGA 
(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : unknown 
(DJ TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 
TTAAGTTATT GCCATAGG 
(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 
AACAAAGCTT CTACAATTCA CAGATAGGA 
(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: .CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:48: 
AACAAGATCT TAAGTTATTG CCATAGG 
(2) INFORMATION FOR SEQ ID NO:49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 
CGGTCTAGAC CACCATGCAC AAATGGATAC TGACATGG 
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(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 
GCCGTCGACC TATTAAGTTA TTGCCATAGG AAG 33 
(2) INFORMATION FOR SEQ ID NO:51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDQ3NESS : unknown ; 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 51: 

Xaa Asn Asp Met Thr Pro Glu Gin Met Ala Thr Asn Val Xaa Xaa Ser 
15 10 IS 



(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid , 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

Ser Tyr Asp Tyr Met Glu Gly Gly Asp lie Arg Val 

1 5 ^ 10 
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WHAT IS CLAIMED IS: 

1. A method for purifying a Jceratinocyte growth 
factor (KGF) , the method comprising: 

a) obtaining a solution comprising KGF; 

b) binding KGF from the solution of part (a) to a 
cation exchange resin; 

c) eluting KGF in an eluate solution from the 
. xation exchange resin; 

d) ' passing the eluate solution from part (c) 
through an appropriate molecular we.ight 
exclusion matrix; and 

e) recovering KGF from the molecular weight 
exclusion matrix. 

2. The method according to Claim 1 wherein the 
KGF is produced in procaryotic cells. 

3. The method according to Claim 1 wherein the 
20 KGF is produced in E. coli. 

4. The method according to Claim 1 wherein the KGF 
is produced in mammalian cells. 

25 5 . The method according* to Claim 4 wherein the 

KGF is produced in Chinese hamster ovary cells. 

6. A method for purifying a Jceratinocyte growth 
factor (KGF) , the method comprising: 
30 a) obtaining a solution, comprising KGF; 

b) binding KGF from the solution of part (a) to a 
cation exchange resin; 

c) eluting KGF in an eluate solution from the 
cation exchange resin; 
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d) performing hydrophobic interaction 
chromatography on the eluate solution of part 
(c); and 

e) recovering KGF from the hydrophobic 

5 interaction chromatography step of part (d) . 

7. A method according to Claim 6 further 
comprising oxidation of free sulfhydryl groups in KGF. 

XO 'a. The method according to Claim 6 wherein the 

KGF is produced in procaryotic cells, 

9. The method according to Claim 7 wherein the 
KGF is produced in E. coli. • 
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10. The method according to Claim 6 wherein the 
KGF is produced in mammalian cells. 



11. The method according, to Claim 10 wherein the 
20 KGF is produced in Chinese hamster ovary cells. 
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Figure 1 

human KGF {+ signal sequence) 

I— -QLXGM25 1 I OftXGOIl - _ . 

5 1 CAAXCTACAATTCACAGA 3 ' 3 'CAATGACCXAGGAGXAACAAXCAAC 3* 

5 ' CAATCTACAATTCACACAT^^ 

- TTCATTTTCATTATgTTATTCATGAACACCCG GAGCACTACACT 

™' *~ H -+ 120 

m a k if i 

- TACTGACAT^TCCTGCCAACTTTGCTCT 

tTMILPTLLYRS C T R X X C ft V ^ 
" XGCGXACXAXAXCXXXAGC XXGCAAXGACAXGACXCCAGAOCAAAXGGCXACAAAXGXGA 

G T X-S ft A C M 0 M T 9 E Q M A T iTT~£ "° 
- ACTGrrCCAGCCCTG AGCgACACACAAGAAGT TA^ 

rO TKRGGPXR 
rACCTGAGCAXCGAXAAAAGAGGCAAAG- 
RTQ WTftR 10 XRGXV 

AAXXA CAAXAXCAXGGAAAXCAGGACACXGGCAG- 

RGTOEMXHKrHXMSXRTVAV . 

1 ' 1 * " > ■ l > 480 

GXVAXKGVESErrftAMKKEG 

-<»AAACTCTATGCAAACAAAG 

' " 1 * 1 ' 1 i i ■ ■ f 540 

KX.TAXKECHED C M T K E ft I ft E 

- AAAACCATrACAAdACATATGC AICAGCTAAATGGACA^ 

" * 1 1 ■ ■■■ ' ■ i ■ ■ f $00 

KHTHTTASAXWTHNGGEMFV 

i 

- XTGCCXTAA A XCAAAAGG GGAXXCCXGXAAGAfiGMAAAAAA C GA ra 

H 1 » ■ > i » 660 

A ft U Q K G X ? VRGKK TK KE QKT 

-CAGCCCAC" i ll»rrLLIAXGGCAAXAACTXAAXXGCAXAXGGXAXAXAAAfiAACCCACXX 

1- I > < I > 720 

AHTftPMAIT* 

3 * GGAX ACCGXX AITGAAXX 5' 
I— OftIGO#26 1 




2/15 
(continued) 



■ CCAGCAGGG^TTTCT TTAAGTG^ 

* ' "~ "* H + 




•GTGCATTTATGTTTGTTTTAAG 3» 

»■ 8« 

-CACCTAAAIACAAAOUUU 5 f 
0LXG0#2— 1 
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fiquz* 3 

R5H-KG? 

plaamld OKA ««X xbaX 

a.quMC. 3 • -AIClATTrGATTCTAGAAGGAGGAATAACJIJATQVAAAAG- 

M K R 

W H signal a«qu«ae« vr-r 

"«KSCACGTGCTATCGCCATTGCIGTGG^ 

KARAIAIAVAIiAGFATSAHA- 

S ' OiCSTGCAATOACATGACTCXSUS^^ 

-CHDMIPEQMATMVKCSSPE* 60 
- ^CGACA<aCaACaAGTTATGATTACATSGAAg^ ^ 

RHTRSTDT Me'gGo'xrvrri,/ "° 
*PnZ CZmI 

" ^C^CSAACAC AC TGSTAeCTSAgSATCSATAAAA SAOSCAAASTAAAACGSACCO 

C R T Q W r L R I d' IC R G K » K o T Q. E 1 "° 
HSAlGAASAATAATTAl^TATCATGGAAATCAGSACAGTGSCAa 

mxmhthzmsz'rt v. 'a v g t v a i' 240 
recur 

~ K:AAAGG «"SGAAAgrGAAgTCTATCTT^^ 

K G V B S E r r t a' M H K ' B G K i X A K 

K B C H C N T K E & I L E N H Y M T 

MdtX 

^ ^ATGClTCACCTAAAJGG ACACACXJl Ct ^A SGGG AJU LIC ; ' ri ' UJ T G CCTTAAAgCAAAA- 
TAS AKVTHNG6EMFVALNQX 

- SQGGKT ICCTGTAAfiaCG* A A A X AAA CGA AfiAAAGAACAAAAAACXS CCC A C I 1 iLmc - 
<5 X?VRGRKTKKSQKTABPL P 
HtfX 

-TATOGCAATAACTTAAIAG 3» -piasaid OKA 

1 ' 303 -3«qu«nc« 

MAX?** 
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TLqnxm 4 



KG? 

Hdml 

5 • TATGTSCAATGACATGACTCCASAGCAAA' 



MCMOMTPBQMATHVN C S S P T " 

R H T R S TD T M B • G 6 0 I rTTTTT "° 
JjMX CJ*T 



^TGAAGAATAATTACAATATCATGGAAATCAa^rTGGCA^ 



Q E 
AAT- 



mkhhtiizmexxt vTv TTTTT 240 

EcoRJ 

•c • v r s t , r t a' h h k e G xTTTT 300 

BSBBtX 

* B C M ' R E L I 'l E H H x h'i* 3 " 

* A S A K If T * H o' 0 E M 'f V A L H Q K + ™ 
"^^^^rc^ AW **fiMSMAAAAAAnW A6AA XS> ^CAAAAAACAGCCCACTTTCTTCC- 



6 I P V R O X X T X ' K E Q X T A M P L p * 4M 

sun 

-TATGGCAATAACTTAAIAC 3' 
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substitution of Kpnl to EcoRI sequence to naJce KGF(dsd) 



KpnX 



} QUGQU , | _ot.T«n*i 



•I I 0LIGO»8 f 60 " 



5 • -ArTATSGaAATCCGIACTSTTGCTGTTGGXAXCtWTGC^TCJUUUJGTGITGAATCTS 3 • 
3 -TAAIACCTTTAGCy^TGACAACGACAACCAIAGCAACCT 



•I I 0I.IGOU1 — , 



I H C I R T V A V C I V A I K G V Z S E - 
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Tiquzm € 

KGF (codon optimized) 

I— — -OLIGO#12 1 

5 'AGTTTTGATCTAGAAGGAGG 3 9 

I ■ -OLXGO #14- 



5 * AGTTrTGATCTAGAAGGAGGAATAACATATGTGCAACGACAXGACTCCGGAACAGATGGCT- 
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